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DATA COMMUNICATION APPARATUS AND 
DATA COMMUNICATION METHOD 

BACKGROUND OF THE INVENTION 

1. Field of the Invention 

[0001] The present invention relates to a data communication apparatus and a data 
communication method that communicate by storing non- voice data in a voice packet, in 
a voice communication system that dialogues by exchanging voice packets via an IP 
network. 

2. Description of Related Art 

[0002] In recent years, so-called VoIP (Voice over Internet Protocol) communication 
system has rapidly become available. The VoIP dialogues by exchanging voice packets 
via an IP network. In addition, an IP telephone adaptor apparatus has been introduced, 
the apparatus enabling the continued use of an ordinary PSTN telephone in the VoIP 
communication system. 

[0003] In addition, a data communication apparatus has become known, realizing 
transmission of high resolution image data for Internet facsimile apparatuses, by 
exchanging capability information in accordance with a mail protocol, the Intemet 
facsimile apparatuses performing data transmission by storing image data in e-mail (see 
Related Art 1). 

[Related Art 1] 

Japanese Patent Laid Open Publication 2002-368815 
[0004] Therefore, by utilizing an IP telephone adaptor apparatus similarly to a 
telephone, it becomes possible, in principle, to use the conventional data communication 
apparatus for the PSTN (e.g., facsimile apparatus) in the VoIP communication system. 
However, since the VoIP communication system is optimized for voice communication, 
various problems occur when used for facsimile communication (data communication). 
For example, the VoIP communication system does not retransmit data to ensure the real- 
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time voice reproduction. Therefore, when there is an overflow within a jitter buffer, the 
data is destroyed. When there is an underflow, the data is interpolated based on the data 
immediately before. Such data destruction and interpolation would lower the 
communication quality for data communication, interfering an accurate data 
reproduction. 

[0005] In order to perform data communication utilizing the VoIP communication 
system, the configuration of the IP telephone adaptor apparatus needs to be changed in 
order to be suited to data communication (to accurately reproduce data). When both 
caller apparatus and receiver apparatus have a function specialized for data 
communication, it is possible to achieve high-speed data communication with an 
independently set specification, without being limited to a common standard method for 
the VoIP communication system. This improvement would greatly enhance the 
convenience of the data communication apparatus. 

SUMMARY OF THE INVENTION 
[0006] The present invention is provided based on the above-described notion of the 
inventors. The main purpose of the present invention is to provide a data communication 
apparatus and a data communication method that are configured to perform high-speed 
data communication without being limited to the common standard method for the voice 
communication system. 

[0007] As a solution to the above-described problems, the present invention provides, 
as illustrated in claim I, a data communication apparatus that communicates by storing 
non-voice data in a voice packet, in a voice communication system that dialogues by 
exchanging voice packets via an IP network. The data communication apparatus includes 
a data processor that operates with two switchable modes, a standard communication 
mode, exchanging voice packets at a predetermined transfer rate, and a high-speed 
communication mode, exchanging voice packets at a faster rate than the one of the 
standard communication mode. The data communication apparatus also includes a 
controller that enables a data process execution, upon obtaining the capability 
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information of an opposing apparatus according to a predetermined communication 
protocol, by selecting the high-speed communication mode, when the opposing apparatus 
has the high-speed communication mode. 

[0008] Accordingly, when the opposing apparatus has the high-speed communication 
mode, i.e., when both caller apparatus and receiver apparatus are the data communication 
apparatuses of the present invention, the high-speed data communication becomes 
available, by selecting and executing the high-speed communication mode that exchanges 
packets in a high speed at an independently set transfer rate. When the opposing 
apparatus does not have the high-speed communication mode, i.e., when a conventional 
data communication apparatus is connected to the IP network via an IP telephone adaptor 
apparatus, the standard communication mode is selected and executed. 
[0009] It is preferable to employ a lossless voice encoding method, such as the ITU-T 
recommended G. 71 1. In addition, the present invention can be configured to have 
several types of high-speed communication modes. 

BRIEF DESCRIPTION OF THE DRAWINGS 
[0010] The present invention is further described in the detailed description which 
follows, with reference to the noted plurality of drawings by way of non-limiting 
examples of exemplary embodiments of the present invention, in which like reference 
numerals represent similar parts throughout the several views of the drawings, and 
wherein: 

Fig. 1 is a block diagram illustrating a facsimile apparatus with a built-in IP 
telephone adaptor according to an embodiment of a data communication apparatus of the 
present invention; 

Fig. 2 is a timing chart illustrating a data process at the facsimile apparatus of Fig. 

1; 

Fig. 3 is a sequence chart illustrating how capability information is exchanged 
between facsimile apparatuses according to the present invention (shown in Fig. 1); 
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Fig. 4 is a sequence chart illustrating how capability information is exchanged 
between a facsimile apparatus of the present invention (shown in Fig. 1) and a 
conventional apparatus; 

Fig. 5 is a schematic diagram illustrating the jitter buffer of Fig. 1; 

Fig. 6 is a timing chart illustrating a data process at a buffer full state, at the 
facsimile apparatus of Fig. 1; and 

Fig. 7 is a timing chart illustrating a data process at a buffer empty state, at the 
facsimile apparatus of Fig. 1. 

DETAILED DESCRIPTION OF PREFERRED EMBODIMENTS 
[0011] The embodiments of the present invention are explained in the following, in 
reference to the above-described drawings. 

[0012] Fig. 1 is a block diagram illustrating a facsimile apparatus with a built-in IP 
telephone adaptor according to an embodiment of a data communication apparatus of the 
present invention. In the VoIP communication system that dialogues by exchanging 
voice packets via the IP network, facsimile apparatus 1 stores facsimile data in a voice 
packet and performs a deemed voice communication with facsimile apparatus 2 having 
the same configuration as facsimile apparatus 1, or with IP telephone adaptor apparatus 
(VoIP A) 4, to which a conventional facsimile apparatus 3 is connected. Facsimile 
apparatus 1 includes transmitter/receiver 1 1, VoIP processor 12, facsimile processor 13, 
and controller 14. 

[0013] Upon transmitting data from facsimile apparatus 1, image data, generated when 
scanner 21 (of facsimile processor 13) scans the transmitting document, is input into 
modem 23 via CPU 22. Modem 23 converts the facsimile data into PCM voice data 
according to the G3 facsimile communication protocol. The PCM voice data is first 
encoded by encoder 32 of codec 31 (of VoIP processor 12), according to the regulated 
voice encoding method of the ITU-T recommended G. 71 1 . Then, the data is buiU into a 
voice packet according to the RTP (Real-time Transport Protocol) by RTP processor 33, 
and transmitted to the IP network via transmitter/receiver 1 1 . 
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[0014] Transmitter/receiver 1 1 receives, from the IP network, the voice packet in which 
the facsimile data is stored. Then, RTF processor 33 (of VoIP processor 12) 
disassembles the packet according to the RTP, and analyses the data to confirm the 
exchange of the packet. A voice frame output from RTP processor 33 is temporarily 
stored in jitter buffer 35 of jitter buffer processor 34 and sequentially output to codec 3 1 . 
Codec 31 perfomis voice frame interpolation at voice interpolator 36 and decoding 
process at decoder 37, and outputs the PCM voice data. The PCM voice data is then 
transmitted to facsimile processor 13 and demodulated at modem 23. Then, the obtained 
image data is recorded on a recording media (such as paper) by recorder 24. 
[0015] Codec 3 1 operates with a standard clock generated by internal clock generator 
42, so that the output process of the voice frame at jitter buffer 35 is synchronized with 
the standard clock of clock generator 42. Also, codec 3 1 and modem 23 are digitally 
connected to each other so that modem 23 can synchronize with the standard clock of 
clock generator 42 within codec 3 1 and perform the coding/decoding process. The clock 
generator 42 can adjust the timing of clock signal generation, in accordance with the 
control signal of controller 14, thereby adjusting the operation timing of the output 
process of jitter buffer 35 and the data process of codec 3 1 and modem 23. 
[0016] Fig. 2 is a timing chart illustrating the data process of the facsimile apparatus 
shown in Fig. 1. The facsimile apparatus has a standard communication mode that 
exchanges voice packets at a predetermined transfer rate, which is standard in the VoIP 
communication system, and a high-speed communication mode that exchanges voice 
packets at a higher speed than that of the standard communication mode. The modes can 
be switched when a switch instruction is given by controller 14, to change the generation 
timing of clock generator 42. Accordingly, the process rate of VoIP processor 12 and 
modem 23 is changed. 

[0017] In this example, the processes at VoIP processor 12 and modem 23 are 
performed at process time "t" in accordance with the pulse generated by clock generator 
42. In the standard communication mode shown in Fig. 2 (A), the voice frame is 
processed between the time intervals (e.g., 10 msec), which is standard for the VoIP 
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communication system. Whereas in the high-speed communication mode shown in Fig. 
2 (B), the voice frames are processed with an independently set time intervals (e.g., 5 
msec), at a higher speed than that of the standard communication mode. 
[0018] In the high-speed communication mode, in addition to the transfer rate change, 
the packet transmission and reception operations are synchronized at the caller and 
receiver apparatuses. This is performed by exchanging synchronization signals 
(synchronization information) with the opposing apparatus, similarly to a synchronous 
modem. In the standard communication mode, in contrast, the data is exchanged in 
accordance with a standard clock installed in each apparatus, similarly to a standard 
asynchronous modem. In addition, in the standard communication mode, codec 31 
operates at a sampling rate (8 kHz) regulated by the ITU-T recommended G.71 1, which 
is standard for the VoIP communication system. However, in the high-speed 
communication mode, there is no such limitation and the sampling rate can be 
independently set (e.g., 16 kHz). 

[0019] Fig. 3 is a sequence chart illustrating how the capability information is 
exchanged between facsimile apparatuses of the present invention (shown in Fig. 1). The 
facsimile apparatuses establish a media session in accordance with the SIP (Session 
Initiation Protocol), which is standardized by the IETF. In this media session, voice 
packets, in which facsimile data is stored, are exchanged. During the establishment of 
the media session, in addition to a signaling message regulated by the SIP, capability 
information is exchanged with the opposing apparatus. Upon checking the capability 
information obtained from the opposing apparatus, and when the opposing apparatus has 
the high-speed communication mode, the high-speed communication mode can be 
selected and executed. In addition, the process related to the SIP is performed by 
controller 14. 

[0020] In this example, when the caller apparatus makes a connection request to the 
receiver apparatus according to the SIP protocol, the caller apparatus transmits, to the 
receiver apparatus, capability information relating to both standard and high-speed 
communication modes. In particular, the capability information of its own apparatus 
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(caller apparatus) is added to the transmitting message (INVITE) requesting for 
connection (request for session establishment). To be specific, information including 
voice information relating to a type of encoding method (codec) usable for the caller 
apparatus in the standard communication mode (in this example, information relating to 
encoding performed by //-lawPCM at sampling rate of 8 kHz) and information that the 
caller apparatus has a high-speed communication mode (SYNC_FAX: synchronization 
facsimile) are added to the message requesting for connection (connection request 
message). Therefore, the receiver apparatus is notified that the caller apparatus has the 
high-speed communication mode. 

[0021] When the receiver apparatus receives the connection request from the caller 
apparatus in accordance with the SIP protocol, and finds that the opposing apparatus 
(caller apparatus) has the high-speed communication mode included in the capability 
information (added to the connection request message), the receiver apparatus selects the 
high-speed communication mode and transmits the capability information relating to the 
high-speed communication mode, upon responding to the opposing apparatus with the 
SIP protocol. In particular, the capability information relating to the high-speed 
communication mode (SYNC FAX) is added to a transmitting response message (200 
OK). Accordingly, the caller apparatus is notified that the receiver apparatus has the 
high-speed communication mode. In the established media session thereafter, voice 
packets are exchanged in the high-speed communication mode. 
[0022] Fig. 4 is a sequence chart illustrating how the capability information is 
exchanged between the facsimile apparatus of the present invention (shown in Fig. 1) and 
a conventional facsimile apparatus. In this example, the caller apparatus is facsimile 1 of 
the present invention. The receiver apparatus is IP telephone adaptor apparatus 4 that is 
connected, in the analog method, to conventional G3 facsimile apparatus 3. In this 
example, the caller apparatus adds capability information of its terminal (caller apparatus) 
to the transmitting connection request message (INVITE), similar to the above example. 
Then, the receiver apparatus (not having the high-speed communication mode) ignores 
the information relating to the high-speed communication mode, but acknowledges the 
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standard communication mode having the encoding method supported by its own 
terminal (receiver apparatus). Therefore, the receiver apparatus transmits a response 
message (200 OK) by adding the capability information of the standard communication 
mode. Accordingly, the caller apparatus is notified that the receiver apparatus does not 
have the high-speed communication mode. In the following established media session, 
voice packets are exchanged in the standard communication mode. 
[0023] In the asynchronous data communication, because of the difference between the 
clock frequencies of both caller and receiver apparatuses (due to the individual difference 
of each clock generator), the deviation is caused between the process timings of the caller 
and receiver apparatuses. For example, when the process timing of the caller side is 
faster than the one of the receiver side, the jitter buffer at the receiver side will gradually 
accumulate data, causing an overflow at the end. Conversely, when the process timing of 
the caller side is slower than the one of the receiver side, the jitter buffer at the receiver 
side will gradually lose data, causing an underflow at the end. These overflow and 
underflow problems, caused by the deviation of the mutual process timings, cannot be 
solved by the limited capacity of the buffer. In other words, since the fluctuation of 
intervals between packet arrivals (due to the delay in the IP network) is non-cumulative, 
the fluctuation can be absorbed by a certain capacity of the buffer. However, when the 
changes in the data amount stored in the buffer are cumulative, the jitter buffer cannot 
absorb the changes by itself. 

[0024] Controller 14 in the present facsimile apparatus adjusts the operation timing in 
accordance with the size of data amount stored in jitter buffer 35. In particular, jitter 
buffer processor 34 includes detector 41 that detects a buffer full state and a buffer empty 
state. In the buffer full state, the data amount stored in jitter buffer 35 exceeds a 
predetermined upper limit. In the buffer empty state, the data mount stored in jitter 
buffer 35 falls short of a predetermined lower limit. When the buffer full state is detected 
by detector 41, the operation timing of the output process of jitter buffer 35 and data 
process of codec 3 1 and modem 23 are accelerated. When the buffer empty state is 
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detected by detector 41, the operation timing of the output process of jitter buffer 35 and 
data process of codec 3 1 and modem 23 are decelerated. 

[0025] Fig. 5 is a schematic diagram of the jitter buffer of Fig. 1. Fig. 5 (A) illustrates 
the buffer full state, while Fig. 5 (B) illustrates the buffer empty state. As jitter buffer 35 
sequentially outputs the stored voice frames from the very bottom of the buffer, each 
voice frame moves downward. A newly input voice frame is stored on the very top of the 
stored data. Jitter buffer 35 has a fixed delay area for a predetermined capacity (e.g., 200 
msec) that absorbs the fluctuation of intervals between packet arrivals at the IP network, 
and the delay in the packet exchanges at RTF processor 33. When the data reaches the 
upper limit of the fixed delay area, the data exchange with codec 3 1 starts. 
[0026] In addition, jitter buffer processor 34 sets the upper limit value (first threshold 
value), which is used by detector 41 to detect the buffer full state, at the upper limit of the 
fixed delay area. Jitter buffer 35 includes a congestion control area, above the fixed 
delay area, for a predetermined capacity (e.g., 200 msec) that handles a situation in which 
a cluster of packets stagnant at a router (due to a congested network) arrive in a very 
short period. Jitter buffer processor 34 sets the lower limit value (second threshold 
value), which is used by detector 41 to detect the buffer empty state, at a predetermined 
value (e.g., 100 msec) of the fixed delay area. 

[0027] Fig. 6 is a timing chart illustrating the data process at the buffer full state, of a 
facsimile apparatus of Fig. 1. Fig. 6 (A) illustrates the standard communication mode, 
whereas Fig. 6 (B) illustrates the high-speed communication mode. When the data 
amount stored in jitter buffer 35 is in an ample state, i.e., between the upper and lower 
limits, the data process is performed at a process rate shown in Fig. 2 (time interval 
Ta/Tb). However, when the data amount stored in jitter buffer 35 exceeds the upper 
limit, i.e., at a buffer full state, the operation timing of the output process of jitter buffer 
35 and the data process of codec 3 1 and modem 23 is accelerated. In this example, the 
data process is performed at a rate twice as fast (time interval TaVTb'). Therefore, it is 
possible to restore the jitter buffer into the ample state and securely avoid the overflow at 
the buffer. 
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[0028] Fig. 7 is a timing chart illustrating the data process at the buffer empty state, of a 
facsimile apparatus of Fig. 1. Fig. 7 (A) illustrates the standard communication mode, 
whereas Fig. 7 (B) illustrates the high-speed communication mode. When the data 
amount stored in jitter buffer 35 is in a buffer empty state, falling short of the lower limit 
value, the operation timing of the output process by jitter buffer 35 and the data process 
by codec 3 1 and modem 23 is decelerated. In this example, until the buffer empty state is 
resolved, the output process by jitter buffer 35 and the data process by codec 3 1 and 
modem 23 are temporarily halted. Accordingly, it is possible to quickly restore the buffer 
to the ample state and securely avoid the underflow at jitter buffer 35. 
[0029] In this embodiment, an example is shown using a data communication apparatus 
(facsimile apparatus) with a built-in IP telephone adaptor apparatus, the data 
communication apparatus having transmitter/receiver 1 1, controller 14, RTF processor 
33, jitter buffer processor 34, and codec 3 1 . However, the IP telephone adaptor apparatus 
having the above components can be separated from the data communication apparatus 
and digitally connected to the data communication apparatus. Also, the data exchange 
between codec 3 1 and modem 23 can be performed by a serial transmission, in place of 
the digital pulse code modulation (PCM). 

[0030] It is noted that the foregoing examples have been provided merely for the 
purpose of explanation and are in no way to be construed as limiting of the present 
invention. While the present invention has been described with reference to exemplary 
embodiments, it is understood that the words which have been used herein are words of 
description and illustration, rather than words of limitation. Changes may be made, 
within the purview of the appended claims, as presently stated and as amended, without 
departing from the scope and spirit of the present invention in its aspects. Although the 
present invention has been described herein with reference to particular structures, 
materials and embodiments, the present invention is not intended to be limited to the 
particulars disclosed herein; rather, the present invention extends to all functionally 
equivalent structures, methods and uses, such as are within the scope of the appended 
claims. 
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[0031] The present invention is not limited to the above-described embodiments, and 
various variations and modifications may be possible without departing from the scope of 
the present invention. 

[0032] This application is based on the Japanese Patent Application No. 2003-286358 
filed on August 5, 2003, entire content of which is expressly incorporated by reference 
herein. 
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